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METHODS AND COMPOSITIONS FOR THE CORRELATION OF SINGLE 
NUCLEOTIDE POLYMORPHISMS IN THE VITAMIN K EPOXIDE REDUCTASE 

GENE AND WARFARIN DOSAGE 

Statement of Priority 

The present application claims the benefit, under 35 U.S.C. § 1 19(e), of U.S. 
Provisional Application Serial No. 60/505,527, filed September 23, 2003, the entire 
contents of which are incorporated by reference herein. 

Government Support 

The present invention was made, in part, with the support of grant numbers 
5P01 HL06350-42 and 5-R01 HL48318 from the National Institutes of Health. The 
United States Government has certain rights to this invention. 

Field of the Invention 

The present invention concerns isolated nucleic acids, host cells containing 
the same, and methods of use thereof, as well as methods and compositions 
directed to identification of single nucleotide polymorphisms (SNPs) in the Vitamin K 
epoxide reductase (VKOR) gene and their correlation with sensitivity to warfarin. 

Background of the Invention 

The function of numerous proteins requires the modification of multiple 
glutamic acid residues to y-carboxyglutamate. Among these vitamin K-dependent 
(VKD) coagulation proteins, FIX (Christmas factor), FVII, and prothrombin are the 
best known. The observation that a knock-out of the gene for matrix Gla protein 
results in calcification of the mouse's arteries (Luo et al. (1997) "Spontaneous 
calcification of arteries and cartilage in mice lacking matrix GLA protein" Nature 
386:78-81) emphasizes the importance of the vitamin K cycle for proteins with 
functions other than coagulation. Moreover, Gas6 and other Gla proteins of 
unknown function are expressed in neural tissue and warfarin exposure in utero 
results in mental retardation and facial abnormalities. This is consistent with the 
observation that the expression of VKD carboxylase, the enzyme that accomplishes 
the Gla modification, is temporally regulated in a tissue-specific manner with high 
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expression in the nervous system during early embryonic stages. Concomitant with 
carboxylation, reduced vitamin K, a co-substrate of the reaction, is converted to 
vitamin K epoxide. Because the amount of vitamin K in the human diet is limited, 
vitamin K epoxide must be converted back to vitamin K by vitamin K epoxide 

5 reductase (VKOR) to prevent its depletion. Warfarin, the most widely used 

anticoagulation drug, targets VKOR and prevents the regeneration of vitamin K. The 
consequence is a decrease in the concentration of reduced vitamin K, which results 
in a reduced rate of carboxylation by the y-glutamyl carboxylase and in the 
production of undercarboxylated vitamin K-dependent proteins. 

10 In the United States alone, warfarin is prescribed to more than one million 

patients per year and in Holland, it has been reported that approximately 2% of the 
population is on long term warfarin therapy. Because the dose of warfarin required 
for a therapeutic level of anticoagulation varies greatly between patients, the 
utilization of warfarin is accompanied by a significant risk of side effects. For 

15 example, it has been reported that following initiation of warfarin therapy, major 

bleeding episodes occurred in 1-2% of patients and death occurred in 0.1-0.7 % of 
patients. In spite of the dangers, it has been estimated that warfarin use can prevent 
20 strokes per induced bleeding episode and is probably underutilized because of 
the fear of induced bleeding. 

20 The present invention overcomes previous shortcomings in the art by 

providing methods and compositions for correlatingsingle nucleotide polymorphisms 
in a subject with an increased or decreased sensitivity to warfarin, thereby allowing 
for more accurate and rapid determination of therapeutic and maintenance doses of 
warfarin at reduced risk to the subject. 

25 

Summary of the Invention 

The present invention provides a method of identifying a human subject 
having increased or decreased sensitivity to warfarin, comprising detecting in the 
subject the presence of a single nucleotide polymorphism in the VKOR gene, 
30 wherein the single nucleotide polymorphism is correlated with increased or 
decreased sensitivity to warfarin, thereby identifying the subject having increased or 
decreased sensitivity to warfarin. 

Additionally provided is a method of identifying a human subject having 
increased or decreased sensitivity to warfarin, comprising: a) correlating the 
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presence of a single nucleotide polymorphism in the VKOR gene with increased or 
decreased sensitivity to warfarin; and b) detecting the single nucleotide 
polymorphism of step (a) in the subject, thereby identifying a subject having 
increased or decreased sensitivity to warfarin. 
5 In a further embodiment, the present invention provides a method of 

identifying a single nucleotide polymorphism in the VKOR gene correlated with 
increased or decreased sensitivity to warfarin, comprising: 

a) identifying a subject having increased or decreased sensitivity to warfarin; 

b) detecting in the subject the presence of a single nucleotide polymorphism 
10 in the VKOR gene; and 

c) correlating the presence of the single nucleotide polymorphism of step (b) 
with the increased or decreased sensitivity to warfarin in the subject, thereby 
identifying a single nucleotide polymorphism in the VKOR gene correlated with 
increased or decreased sensitivity to warfarin. 

15 In addition, the present invention provides a method of correlating a single 

nucleotide polymorphism in the VKOR gene of a subject with increased or decreased 
sensitivity to warfarin, comprising: a) identifying a subject having increased or 
decreased sensitivity to warfarin; b) determining the nucleotide sequence of the 
VKOR gene of the subject of (a); c) comparing the nucleotide sequence of step (b) 

20 with the wild type nucleotide sequence of the VKOR gene; d) detecting a single 
nucleotide polymorphism in the nucleotide sequence of (b); and e) correlating the 
single nucleotide polymorphism of (d) with increased or decreased sensitivity to 
warfarin in the subject of (a). 

A further aspect of the present invention is an isolated nucleic acid encoding 

25 vitamin K epoxide reductase (VKOR), particularly mammalian (e.g., human, ovine, 
bovine, monkey, etc.) VKOR. Examples include (a) nucleic acids as disclosed 
herein, such as isolated nucleic acids having the nucleotide sequence as set forth in 
SEQ ID NO: 8 or SEQ ID NO: 9; (b) nucleic acids that hybridize to isolated nucleic 
acids of (a) above or the complement thereof (e.g., under stringent conditions), 

30 and/or have substantial sequence identity to nucleic acids of (a) above (e.g., are 80, 
85, 90 95 or 99% identical to nucleic acids of (a) above), and encode a VKOR; and 
(c) nucleic acids that differ from the nucleic acids of (a) or (b) above due to the 
degeneracy of the genetic code, but code for a VKOR encoded by a nucleic acid of 
(a) or (b) above. 
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The term "stringent" as used here refers to hybridization conditions that are 
commonly understood in the art to define the commodities of the hybridization 
procedure. Stringency conditions can be low, high or medium, as those terms are 
commonly know in the art and well recognized by one of ordinary skill. High 

5 stringency hybridization conditions that will permit homologous nucleotide sequences 
to hybridize to a nucleotide sequence as given herein are well known in the art. As 
one example, hybridization of such sequences to the nucleic acid molecules 
disclosed herein can be carried out in 25% formamide, 5X SSC, 5X Denhardt's 
solution and 5% dextran sulfate at 42°C, with wash conditions of 25% formamide, 5X 

10 SSC and 0.1% SDS at 42°C, to allow hybridization of sequences of about 60% 
homology. Another example includes hybridization conditions of 6X SSC, 0.1% SDS 
at about 45°C, followed by wash conditions of 0.2X SSC, 0.1% SDS at 50-65°C. 
Another example of stringent conditions is represented by a wash stringency of 0.3 
M NaCI, 0.03M sodium citrate, 0.1% SDS at 60"70°C using a standard hybridization 

15 assay (see SAMBROOK et al., EDS., MOLECULAR CLONING: A LABORATORY 
MANUAL 2d ed. (Cold Spring Harbor, NY 1989, the entire contents of which are 
incorporated by reference herein). In various embodiments, stringent conditions can 
include, for example, highly stringent (i.e., high stringency) conditions (e.g., 
hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate 

20 (SDS), 1 mM EDTA at 65°C, and washing in 0.1xSSC/0.1% SDS at 68°C), and/or 
moderately stringent (i.e., medium stringency) conditions (e.g., washing in 
0.2xSSC/0.1% SDS at42°C.). 

An additional aspect of the present invention is a recombinant nucleic acid 
comprising a nucleic acid encoding vitamin K epoxide reductase as described herein 

25 operatively associated with a heterologous promoter. 

A further aspect of the present invention is a cell that contains and expresses 
a recombinant nucleic acid as described above. Suitable cells include plant, animal, 
mammal, insect, yeast and bacterial cells. 

A further aspect of the present invention is an oligonucleotide that hybridizes 

30 to an isolated nucleic acid encoding VKOR as described herein. 

A further aspect of the present invention is isolated and purified VKOR (e.g., 
VKOR purified to homogeneity) encoded by a nucleic acid as described herein. For 
example, the VKOR of this invention can comprise the amino acid sequence as set 
forth in SEQ ID NO: 10. 
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A further aspect of the present invention is a method of making a vitamin K 
dependent protein which comprises culturing a host cell that expresses a nucleic 
acid encoding a vitamin K dependent protein in the presence of vitamin K and 
produces a vitamin K dependent protein, and then harvesting the vitamin K 
5 dependent protein from the culture, the host cell containing and expressing a 

heterologous nucleic acid encoding vitamin K dependent carboxylase, and the host 
cell further containing and expressing a heterologous nucleic acid encoding vitamin 
K epoxide reductase (VKOR) and producing VKOR as described herein. 

10 Brief Description of the Drawings 

Figures 1 A-D Comparisons of warfarin dosages in wild type, heterozygous 
and homozygous subjects for SNPs vk 2581, vk3294 and vk4769, as well as a 
comparison of warfarin dosages in wild type and heterozygous subjects for P450 
2Y9. 

1 5 Figure 2. For each of the 1 3 siRNA pools, three T7 flasks containing A549 

cells were transfected and VKOR activity determined after 72 h. The VKOR assay 
used 25 uM vitamin K epoxide. One siRNA pool specific for gene gi:13124769 
reduced VKOR activity by 64%-70% in eight repetitions. 

Figure 3. Time course of inhibition of VKOR activity by the siRNA pool 

20 specific for gi: 131 24769 in A549 cells. VKOR activity decreased continuously during 
this time period while the level of its mRNA decreased rapidly to about 20% of 
normal. 25 uM vitamin K epoxide was used for this assay. The siRNA did not affect 
the activity of VKD carboxylase or the level of lamin A/C mRNA. 

Figure 4. VKOR activity was detected when mGC_1 1276 was expressed in 

25 Sf9 insect cells. ~1x10 6 cells were used in this assay. Reactions were performed 
using 32 uM KO at 30°C for 30 minutes in Buffer D. Blank Sf9 cells served as a 
negative control and A549 cells as a reference. 

Figure 5. Inhibition of VKOR by warfarin. Reactions were performed using 1.6 
mg microsomal proteins made from VKOR_Sf9 cells, 60 uM KO, and various 

30 concentration of warfarin at 30°C for 1 5 minutes in Buffer D. 

Detailed Description of the Invention 

As used herein, "a," "an" or "the" can mean one or more than one. For 
example, "a" cell can mean a single cell or a multiplicity of cells. 
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The present invention is explained in greater detail below. This description is 
not intended to be a detailed catalog of all the different ways in which the invention 
may be implemented, or all the features that may be added to the instant invention. 
For example, features illustrated with respect to one embodiment may be 
5 incorporated into other embodiments, and features illustrated with respect to a 

particular embodiment may be deleted from that embodiment. In addition, numerous 
variations and additions to the various embodiments suggested herein will be 
apparent to those skilled in the art in light of the instant disclosure which do not 
depart from the instant invention. Hence, the following specification is intended to . 

10 illustrate some particular embodiments of the invention, and not to exhaustively 
specify all permutations, combinations and variations thereof. 

The "Sequence Listing" attached hereto forms a part of the instant 
specification as if fully set forth herein. 

The present invention may be carried out based on the instant disclosure and 

15 further utilizing methods, components and features known in the art, including but 
not limited to those described in US Patent No. 5,268,275 to Stafford and Wu and 
US Patent No. 6,531,298 to Stafford and Chang, the disclosures of which are 
incorporated by reference herein in their entirety as if fully set forth herein. 

As used herein, "nucleic acids" encompass both RNA and DNA, including 

20 cDNA, genomic DNA, synthetic (e.g., chemically synthesized) DNA and chimeras of 
RNA and DNA. The nucleic acid may be double-stranded or single-stranded. Where 
single-stranded, the nucleic acid may be a sense strand or an antisense strand. The 
nucleic acid may be synthesized using oligonucleotide analogs or derivatives (e.g., 
inosine or phosphorothioate nucleotides). Such oligonucleotides can be used, for 

25 example, to prepare nucleic acids that have altered base-pairing abilities or 
increased resistance to nucleases. 

An "isolated nucleic acid" is a DNA or RNA that is not immediately contiguous 
with both of the coding sequences with which it is immediately contiguous (one on 
the 5* end and one on the 3' end) in the naturally occurring genome of the organism 

30 from which it is derived. Thus, in one embodiment, an isolated nucleic acid includes 
some or all of the 5' non-coding (e.g., promoter) sequences that are immediately 
contiguous to the coding sequence. The term therefore includes, for example, a 
recombinant DNA that is incorporated into a vector, into an autonomously replicating 
plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which 
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exists as a separate molecule (e.g., a cDNA or a genomic DNA fragment produced 
by PCR or restriction endonuclease treatment), independent of other sequences. It 
also includes a recombinant DNA that is part of a hybrid gene encoding an additional 
polypeptide sequence. 
5 The term "isolated" can refer to a nucleic acid or polypeptide that is 

substantially free of cellular material, viral material, or culture medium (when 
produced by recombinant DNA techniques), or chemical precursors or other 
chemicals (when chemically synthesized). Moreover, an "isolated nucleic acid 
fragment" is a nucleic acid fragment that is not naturally occurring as a fragment and 

10 would not be found in the natural state. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least about 
six nucleotides to about 100 nucleotides, for example, about 15 to 30 nucleotides, or 
about 20 to 25 nucleotides, which can be used, for example, as a primer in a PCR 
amplification or as a probe in a hybridization assay or in a microarray. 

15 Oligonucleotides may be natural or synthetic, e.g., DNA, RNA, modified backbones, 
etc. 

Where a particular nucleotide sequence is said to have a specific percent 
identity to a reference nucleotide sequence, the percent identity is relative to the 
re f erence nucleotide sequence. For example, a nucleotide sequence that is 50%, 

20 75%, 85%, 90%, 95% or 99% identical to a reference nucleotide sequence that is 

100 bases long can have 50, 75, 85, 90, 95 or 99 bases that are completely identical 
to a 50, 75, 85, 90, 95 or 99 nucleotide sequence of the reference nucleotide 
sequence. The nucleotide sequence can also be a 100 base long nucleotide 
sequence that is 50%, 75%, 85%, 90%, 95% or 99% identical to the reference 

25 nucleotide sequence over its entire length. Of course, there are other nucleotide 
sequences that will also meet the same criteria. 

A nucleic acid sequence that is "substantially identical" to a VKOR nucleotide 
sequence is at least 80%, 85% 90%, 95% or 99% identical to the nucleotide 
sequence of SEQ ID NO:8 or 9. For purposes of comparison of nucleic acids, the 

30 length of the reference nucleic acid sequence will generally be at least 40 

nucleotides, e.g., at least 60 nucleotides or more nucleotides. Sequence identity can 
be measured using sequence analysis software (e.g., Sequence Analysis Software 
Package of the Genetics Computer Group, University of Wisconsin Biotechnology 
Center, 1710 University Avenue, Madison, Wis. 53705). 
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As is known in the art, a number of different programs can be used to identify 
whether a nucleic acid or amino acid has sequence identity or similarity to a known 
sequence. Sequence identity or similarity may be determined using standard 
techniques known in the art, including, but not limited to, the local sequence identity 

5 algorithm of Smith & Waterman, Adv. Appl. Math. 2, 482 (1981), by the sequence 
identity alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48,443 (1970), by 
the search for similarity method of Pearson & Lipman, Proc. Natl. Acad. Sci. USA 
85,2444 (1988), by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 

10 Genetics Computer Group, 575 Science Drive, Madison, Wl), the Best Fit sequence 
program described by Devereux et a/., Nucl. Acid Res. 12, 387-395 (1984), 
preferably using the default settings, or by inspection. 

An example of a useful algorithm is PILEUP. PILEUP creates a multiple 
sequence alignment from a group of related sequences using progressive, pairwise 

15 alignments. It can also plot a tree showing the clustering relationships used to 
create the alignment. PILEUP uses a simplification of the progressive alignment 
method of Feng & Doolittle, J. Mol. Evol. 35, 351-360 (1987); the method is similar to 
that described by Higgins & Sharp, CABIOS 5, 151-153 (1989). 

Another example of a useful algorithm is the BLAST algorithm, described in 

20 Altschul et ai, J. Mol. Biol. 215, 403-410, (1990) and Karlin et a/., Proc. Natl. Acad. 
Sci. USA 90, 5873-5787 (1993). A particularly useful BLAST program is the WU- 
BLAST-2 program that was obtained from Altschul et ai, Methods in Enzymology, 
266, 460-480 (1996). WU-BLAST-2 uses several search parameters, which are 
preferably set to the default values. The parameters are dynamic values and are 

25 established by the program itself depending upon the composition of the particular 
sequence and composition of the particular database against which the sequence of 
interest is being searched; however, the values may be adjusted to increase 
sensitivity. An additional useful algorithm is gapped BLAST as reported by Altschul 
et al. Nucleic Acids Res. 25, 3389-3402. 

30 The CLUSTAL program can also be used to determine sequence similarity. 

This algorithm is described by Higgins etal. (1988) Gene 73:237; Higgins etal. 
(1989) CABIOS 5:151-153; Corpetefa/. (1988) Nucleic Acids Res. 16: 10881-90; 
Huang etal. (1992) CABIOS 8: 155-65; and Pearson etal. (1994) Meth. Mol. Biol. 
24: 307-331. 



8 



WO 2005/030039 PCT/US2004/031481 

In addition, for sequences that contain either more or fewer nucleotides than 
the nucleic acids disclosed herein, it is understood that in one embodiment, the 
percentage of sequence identity will be determined based on the number of identical 
nucleotides in relation to the total number of nucleotide bases. Thus, for example, 
sequence identity of sequences shorter than a sequence specifically disclosed 
herein will be determined using the number of nucleotide bases in the shorter 
sequence, in one embodiment. In percent identity calculations, relative weight is not 
assigned to various manifestations of sequence variation, such as, insertions, 
deletions, substitutions, efc. 

The VKOR polypeptides of the invention include, but are not limited to, 
recombinant polypeptides, synthetic peptides and natural polypeptides. The 
invention also encompasses nucleic acid sequences that encode forms of VKOR 
polypeptides in which naturally occurring amino acid sequences are altered or 
deleted. Preferred nucleic acids encode polypeptides that are soluble under normal 
physiological conditions. Also within the invention are nucleic acids encoding fusion 
proteins in which all or a portion of VKOR is fused to an unrelated polypeptide (e.g., 
a marker polypeptide or a fusion partner) to create a fusion protein. For example, 
the polypeptide can be fused to a hexa-histidine tag to facilitate purification of 
bacterially expressed polypeptides, or to a hemagglutinin tag to facilitate purification 
of polypeptides expressed in eukaryotic cells, or to an HPC4 tag to facilitate 
purification of polypeptides by affinity chromatography or immunoprecipitation. The 
invention also includes isolated polypeptides (and the nucleic acids that encode 
these polypeptides) that include a first portion and a second portion; the first portion 
includes, e.g., all or a portion of a VKOR polypeptide, and the second portion 
includes, e.g., a detectable marker. 

The fusion partner can be, for example, a polypeptide that facilitates 
secretion, e.g., a secretory sequence. Such a fused polypeptide is typically referred 
to as a preprotein. The secretory sequence can be cleaved by the cell to form the 
mature protein. Also within the invention are nucleic acids that encode VKOR fused 
to a polypeptide sequence to produce an inactive preprotein. Preproteins can be 
converted into the active form of the protein by removal of the inactivating sequence. 

The invention also includes nucleic acids that hybridize, e.g., under stringent 
hybridization conditions (as defined herein) to all or a portion of the nucleotide 
sequence of SEQ ID NOS: 1-6. 8 or 9 or their complements. In particular 
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embodiments, the hybridizing portion of the hybridizing nucleic acid is typically at 
least 15 (e.g., 20, 30, or 50) nucleotides in length. The hybridizing portion of the 
hybridizing nucleic acid is at least 80%, e.g., at least 95%, at least 98% or 100%, 
identical to the sequence of a portion or all of a nucleic acid encoding a VKOR 

5 polypeptide. Hybridizing nucleic acids of the type described herein can be used, for 
example, as a cloning probe, a primer (e.g., a PCR primer), or a diagnostic probe. 
Also included within the invention are small inhibitory RNAs (siRNAs) and/or 
antisense RNAs that inhibit the function of VKOR, as determined, for example, in an 
activity assay, as described herein and as is known in the art. 

10 In another embodiment, the invention features cells, e.g., transformed cells, 

that contain a nucleic acid of this invention. A "transformed cell" is a cell into which 
(or into an ancestor of which) has been introduced, by means of recombinant nucleic 
acid techniques, a nucleic acid encoding all or a part of a VKOR polypeptide, and/or 
an antisense nucleic acid or siRNA. Both prokaryotic and eukaryotic cells are 

15 included, e.g., bacteria, yeast, insect, mouse, rat, human, plant and the like. 

The invention also features nucleic acid constructs (e.g., vectors and 
plasmids) that include a nucleic acid of the invention that is operably linked to a 
transcription and/or translation control elements to enable expression, e.g., 
expression vectors. By "operably linked" is meant that a selected nucleic acid, e.g., a 

20 DNA molecule encoding a VKOR polypeptide, is positioned adjacent to one or more 
regulatory elements, e.g., a promoter, which directs transcription and/or translation of 
the sequence such that the regulatory elements can control transcription and/or 
translation of the selected nucleic acid. 

The present invention further provides fragments or oligonucleotides of the 

25 nucleic acids of this invention, which can be used as primers or probes. Thus, in 
some embodiments, a fragment or oligonucleotide of this invention is a nucleotide 
sequence that is at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 
90, 100, 125, 150, 175, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 
800, 850, 900, 1000, 1500, 2000, 2500 or 3000 contiguous nucleotides of the 

30 nucleotide sequence set forth in SEQ ID NO:8 or SEQ ID NO:9. Examples of 
oligonucleotides of this invention are provided in the Sequence Listing included 
herewith. Such fragments or oligonucleotides can be detectably labeled or modified, 
for example, to include and/or incorporate a restriction enzyme cleavage site when 
employed as a primer in an amplification (e.g., PCR) assay. 
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The invention also features purified or isolated VKOR polypeptides, such as, 
for example, a polypeptide comprising, consisting essentially of and/or consisting of 
the amino acid sequence of SEQ ID NO: 10 or a biologically active fragment or 
peptide thereof. Such fragments or peptides are typically at least about ten amino 
acids of the amino acid sequence of SEQ ID NO:10 (e.g., 15, 20, 25, 30, 35, 40, 45, 
50, 55, 60, 65, 75, 85, 95, 100, 125, or 150 amino acids of the amino acid sequence 
of SEQ ID NO:10) and can be peptides or fragment of contiguous amino acids of the 
amino acid sequence of the VKOR protein (e.g., as set forth in SEQ ID NO:10). The 
biological activity of a fragment or peptide of this invention can be determined 
according to the methods provided herein and as are known in the art for identifying 
VKOR activity. The fragments and peptides of the VKOR protein of this invention 
can also be active as antigens for the production of antibodies. The identification of 
epitopes on a fragment or peptide of this invention is carried out by well known 
protocols and would be within the ordinary skill of one in the art. 

As used herein, both "protein" and "polypeptide" mean any chain of amino 
acids, regardless of length or post-translational modification (e.g., glycosylation, 
phosphorylation or N-myristylation). Thus, the term "VKOR polypeptide" includes 
full-length, naturally occurring VKOR proteins, respectively, as well as recombinantly 
or synthetically produced polypeptides that correspond to a full-length, naturally 
occurring VKOR protein, or to a portion of a naturally occurring or synthetic VKOR 
polypeptide. 

A "purified" or "isolated" compound or polypeptide is a composition that is at 
least 60% by weight the compound of interest, e.g., a VKOR polypeptide or antibody 
that is separated or substantially free from at least some of the other components of 
the naturally occurring organism or virus, for example, the cell or viral structural 
components or other polypeptides or nucleic acids commonly found associated with 
the polypeptide. As used herein, the "isolated" polypeptide is at least about 25%, 
50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99% or more pure (w/w). 
Preferably the preparation is at least 75% (e.g., at least 90% or 99%) by weight the 
compound of interest. Purity can be measured by any appropriate standard method, 
e.g., column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. 

Preferred VKOR polypeptides include a sequence substantially identical to all 
or a portion of a naturally occurring VKOR polypeptide. Polypeptides "substantially 
identical" to the VKOR polypeptide sequences described herein have an amino acid 
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sequence that is at least 80% or 85% (e.g., 90%, 95% or 99%) identical to the amino 
acid sequence of the VKOR polypeptides of SEQ ID NO: 10. For purposes of 
comparison, the length of the reference VKOR polypeptide sequence will generally 
be at least 16 amino acids, e.g., at least 20, 25, 30, 35, 40, 45, 50, 75, or 100 amino 
5 acids. 

In the case of polypeptide sequences that are less than 100% identical to a 
reference sequence, the non-identical positions are preferably, but not necessarily, 
conservative substitutions for the reference sequence. Conservative substitutions 
typically include, but are not limited to, substitutions within the following groups: 

10 glycine and alanine; valine, isoleucine, and leucine; aspartic acid and glutamic acid; 
asparagine and glutamine; serine and threonine; lysine and arginine; and 
phenylalanine and tyrosine. 

Where a particular polypeptide is said to have a specific percent identity to a 
reference polypeptide of a defined length, the percent identity is relative to the 

15 reference polypeptide. Thus, for example, a polypeptide that is 50%, 75%, 85%, 
90%, 95% or 99% identical to a reference polypeptide that is 100 amino acids long 
can be a 50, 75, 85, 90, 95 or 99 amino acid polypeptide that is completely identical 
to a 50, 75, 85, 90, 95 or 99 amino acid long portion of the reference polypeptide. It 
can also be a 100 amino acid long polypeptide that is 50%, 75%, 85%, 90%, 95% or 

20 99% identical to the reference polypeptide over its entire length. Of course, other 
polypeptides also will meet the same criteria. 

The invention also features purified or isolated antibodies that specifically bind 
to a VKOR polypeptide of this invention or to a fragment thereof. By "specifically 
binds" is meant that an antibody recognizes and binds a particular antigen, e.g., a 

25 VKOR polypeptide, or an epitope on a fragment or peptide of a VKOR polypeptide, 
but does not substantially recognize and bind other molecules in a sample. In one 
embodiment the antibody is a monoclonal antibody and in other embodiments, the 
antibody is a polyclonal antibody. The production of both monoclonal and polyclonal 
antibodies, including chimeric antibodies, humanized antibodies, single chain 

30 antibodies, bi-specific antibodies, antibody fragments, etc., is well known in the art. 

In another aspect, the invention features a method for detecting a VKOR 
polypeptide in a sample. This method comprises contacting the sample with an 
antibody that specifically binds a VKOR polypeptide or a fragment thereof under 
conditions that allow the formation of a complex between an antibody and VKOR; 
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and detecting the formation of a complex, if any, as detection of a VKOR polypeptide 
or fragment thereof in the sample. Such immunoassays are well known in the art 
and include immunoprecipitation assays, immunoblotting assays, immunolabeling 
assays, ELISA, etc. 

5 The present invention further provides a method of detecting a nucleic acid 

encoding a VKOR polypeptide in a sample, comprising contacting the sample with a 
nucleic acid of this invention that encodes VKOR or a fragment thereof, or a 
complement of a nucleic acid that encodes VKOR or a fragment thereof, under 
conditions whereby a hybridization complex can form, and detecting formation of a 

10 hybridization complex, thereby detecting a nucleic acid encoding a VKOR 

polypeptide in a sample. Such hybridization assays are well known in the art and 
include probe detection assays and nucleic acid amplification assays. 

Also encompassed by the invention is a method of obtaining a gene related to 
(i.e., a functional homologue of) the VKOR gene. Such a method entails obtaining or 

15 producing a detectably-labeled probe comprising an isolated nucleic acid which 
encodes all or a portion of VKOR, or a homolog thereof; screening a nucleic acid 
fragment library with the labeled probe under conditions that allow hybridization of 
the probe to nucleic acid fragments in the library, thereby forming nucleic acid 
duplexes; isolating labeled duplexes, if any; and preparing a full-length gene 

20 sequence from the nucleic acid fragments in any labeled duplex to obtain a gene 
related to the VKOR gene. 

A further aspect of the present invention is a method of making a vitamin K 
dependent protein which comprises culturing a host cell that expresses a nucleic 
acid encoding a vitamin K dependent protein in the presence of vitamin K and 

25 produces a vitamin K dependent protein, and then harvesting the vitamin K 
dependent protein from the culture, the host cell containing and expressing a 
heterologous nucleic acid encoding vitamin K dependent carboxylase, and the host 
cell further containing and expressing a heterologous nucleic acid encoding vitamin 
K epoxide reductase (VKOR) and producing VKOR as described herein. The 

30 expression of the VKOR-encoding nucleic acid and the production of the VKOR 
causes the cell to produce greater levels of the vitamin K dependent protein than 
would be produced in the absence of the VKOR. 

Thus, in some embodiments, the present invention also provides a method of 
producing a vitamin K dependent protein, comprising: 
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a) introducing into a cell a nucleic acid that encodes a vitamin K dependent 
protein under conditions whereby the nucleic acid is expressed and the vitamin K 
dependent protein is produced in the presence of vitamin K, wherein the cell 
comprises a heterologous nucleic acid encoding vitamin K dependent carboxylase 

5 and further comprises a heterologous nucleic acid encoding vitamin K epoxide 
reductase; and 

b) optionally collecting the vitamin K dependent protein from the cell. The 
vitamin K dependent protein that can be produced can be any vitamin K dependent 
protein now known or later identified as such, including but not limited to Factor VII, 

10 Factor IX, Factor X, Protein C, Protein S and prothrombin, in any combination. Any 
host cell that can be transformed with the nucleic acids described can be used as 
described herein, although in some embodiments non-human or even non- 
mammalian host cells can be used. Nucleic acids encoding vitamin K dependent 
carboxylase and nucleic acids encoding vitamin K dependent proteins as described 

1 5 herein are well known in the art and their introduction into cells for expression would 
be carried out according to routine protocols. 

Certain embodiments of this invention are based on the inventors' discovery 
that a subject's therapeutic dose of warfarin for anticoagulation therapy can be 
correlated with the presence of one or more single nucleotide polymorphisms in the 

20 VKOR gene of the subject. Thus, the present invention also provides a method of 
identifying a human subject having increased or decreased sensitivity to warfarin, 
comprising detecting in the subject the presence of a single nucleotide polymorphism 
(SNP) in the VKOR gene, wherein the single nucleotide polymorphism is correlated 
with increased or decreased sensitivity to warfarin, thereby identifying the subject as 

25 having increased or decreased sensitivity to warfarin. 

An example of a SNP correlated with an increased sensitivity to warfarin is a 
G->C alteration at nucleotide 2581 (SEQ ID NO: 12) (in intron 2 of the VKOR gene; 
GenBank accession no. refSNP ID: rs8050894, incorporated by reference herein) of 
the nucleotide sequence of SEQ ID NO:11, which is a reference sequence 

30 encompassing the genomic sequence of SEQ ID NO:8 and approximately 1000 
nucleotides preceding and following this sequence. This sequence can be located 
as having the genome position "human chromosome 16p11.2" or in the physical map 
in the NCBI database as human chromosome 16: 31009700-31013800. 
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Examples of SNPs correlated with a decreased sensitivity to warfarin are a 
T— C alteration at nucleotide 3294 (SEQ ID NO:13) (in intron 2 of the VKOR gene; 
GenBank accession no. refSNP ID: rs2359612, incorporated by reference herein) of 
the nucleotide sequence of SEQ ID NO:11 and a G->A alteration at nucleotide 4769 
5 (SEQ ID NO: 14) (in the 3' UTR of the VKOR gene; GenBank accession no. refSNP 
ID: rs7294, incorporated by reference herein) of the nucleotide sequence of SEQ ID 
NO:11. 

As used herein, a subject having an "increased sensitivity to warfarin" is a 
subject for whom a suitable therapeutic or maintenance dose of warfarin is lower 

10 than the therapeutic or maintenance dose of warfarin that would suitable for a 
normal subject, i.e., a subject who did not carry a SNP in the VKOR gene that 
imparts a phenotype of increased sensitivity to warfarin. Conversely, as used herein, 
a subject having a "decreased sensitivity to warfarin" is a subject for whom a suitable 
therapeutic or maintenance dose of warfarin is higher than the therapeutic or 

15 maintenance dose of warfarin that would suitable for a normal subject, i.e., a subject 
who did not carry a SNP in the VKOR gene that imparts a phenotype of decreased 
sensitivity to warfarin. An example of a typical therapeutic dose of warfarin for a 
normal subject is 35 mg per week, although this amount can vary (e.g., a dose range 
of 3.5 to 420 mg per week is described in Aithal et al. (1999) Lancet 353:717-719). 

20 A typical therapeutic dose of warfarin can be determined for a given study group 
according to the methods described herein, which can be used to identify subjects 
with therapeutic warfarin doses above or below this dose, thereby identifying 
subjects having decreased or increased sensitivity to warfarin. 

Further provided herein is a method of identifying a human subject having 

25 increased or decreased sensitivity to warfarin, comprising: a) correlating the 
presence of a single nucleotide polymorphism in the VKOR gene with increased or 
decreased sensitivity to warfarin; and b) detecting the single nucleotide 
polymorphism of step (a) in the subject, thereby identifying a subject having 
increased or decreased sensitivity to warfarin. 

30 In addition, the present invention provides a method of identifying a single 

nucleotide polymorphism in the VKOR gene correlated with increased or decreased 
sensitivity to warfarin, comprising: a) identifying a subject having increased or 
decreased sensitivity to warfarin; b) detecting in the subject the presence of a single 
nucleotide polymorphism in the VKOR gene; and c) correlating the presence of the 
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single nucleotide polymorphism of step (b) with the increased or decreased 
sensitivity to warfarin in the subject, thereby identifying a single nucleotide 
polymorphism in the VKOR gene correlated with increased or decreased sensitivity 
to warfarin. 

5 Also provided herein is a method of correlating a single nucleotide 

polymorphism in the VKOR gene of a subject with increased or decreased sensitivity 
to warfarin, comprising: a) identifying a subject having increased or decreased 
sensitivity to warfarin; b) determining the nucleotide sequence of the VKOR gene of 
the subject of (a); c) comparing the nucleotide sequence of step (b) with the wild type 

10 nucleotide sequence of the VKOR gene; d) detecting a single nucleotide 

polymorphism in the nucleotide sequence of (b); and e) correlating the single 
nucleotide polymorphism of (d) with increased or decreased sensitivity to warfarin in 
the subject of (a). 

A subject is identified as having an increased or decreased sensitivity to 

1 5 warfarin by establishing a therapeutic or maintenance dose of warfarin for 

anticoagulation therapy according to well known protocols and comparing the 
therapeutic or maintenance dose for that subject with the therapeutic or maintenance 
dose of warfarin for anticoagulation therapy of a population of normal subjects (e.g., 
subjects lacking any SNPs in the VKOR gene correlated with increased or 

20 decreased sensitivity to warfarin) from which an average or mean therapeutic or 
maintenance dose of warfarin is calculated. A subject having a therapeutic or 
maintenance dose of warfarin that is below the average therapeutic or maintenance 
dose of warfarin (e.g., the dose of warfarin that is therapeutic or provides a 
maintenance level for a subject that has a wild type VKOR gene, i.e., lacking any 

25 single nucleotide polymorphisms associated with warfarin sensitivity) is a subject 
identified as having an increased sensitivity to warfarin. A subject having a 
therapeutic or maintenance dose of warfarin that is above the average therapeutic or 
maintenance of warfarin is a subject identified as having a decreased sensitivity to 
warfarin. An average therapeutic or maintenance dose of warfarin for a subject with 

30 a wild type VKOR gene would be readily determined by one skilled in the art. 

The nucleotide sequence of the VKOR gene of a subject is determined 
according to methods standard in the art, and as described in the Examples provided 
herein. For example, genomic DNA is extracted from cells of a subject and the 
VKOR gene is located and sequenced according to known protocols. Single 
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nucleotide polymorphisms in the VKOR gene are identified by a comparison of a 
subject's sequence with the wild type sequence as known in the art (e.g., the 
reference sequence as shown herein as SEQ ID NO:1 1). 

A SNP in the VKOR gene is correlated with an increased or decreased sensitivity 
to warfarin by identifying the presence of a SNP or multiple SNPs in the VKOR gene 
of a subject also identified as having increased or decreased sensitivity to warfarin, 
i.e., having a maintenance or therapeutic dose of warfarin that is above or below the 
average dose and performing a statistical analysis of the association of the SNP or 
SNPs with the increased or decreased sensitivity to warfarin, according to well 
known methods of statistical analysis. An analysis that identifies a statistical 
association (e.g., a significant association) between the SNP(s) (genotype) and 
increased or decreased warfarin sensitivity (phenotype) establishes a correlation 
between the presence of the SNP(s) in a subject and an increased or decreased 
sensitivity to warfarin in that subject. 

It is contemplated that a combination of factors, including the presence of one 
or more SNPs in the VKOR gene of a subject, can be correlated with an increased or 
decreased sensitivity to warfarin in that subject. Such factors can include, but are 
not limited to cytochrome p450 2C9 polymorphisms, race, age, gender, smoking 
history and hepatic disease. 

Thus, in a further embodiment, the present invention provides a method of 
identifying a human subject having increased or decreased sensitivity to warfarin, 
comprising identifying in the subject the presence of a combination of factors 
correlated with an increased or decreased sensitivity to warfarin selected from the 
group consisting of one or more single nucleotide polymorphisms of the VKOR gene, 
one or more cytochrome p450 2C9 polymorphisms, race, age, gender, smoking 
history, hepatic disease and any combination of two or more of these factors, 
wherein the combination of factors is correlated with increased or decreased 
sensitivity to warfarin, thereby identifying the subject having increased or decreased 
sensitivity to warfarin. 

Further provided herein is a method of identifying a human subject having 
increased or decreased sensitivity to warfarin, comprising: a) correlating the 
presence of a combination of factors with an increased or decreased sensitivity to 
warfarin, wherein the factors are selected from the group consisting of one or more 
single nucleotide polymorphisms of the VKOR gene, one or more cytochrome p450 
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2C9 polymorphisms, race, age, gender, smoking history, hepatic disease and any 
combination of two or more of these factors; and b) detecting the combination of 
factors of step (a) in the subject, thereby identifying a subject having increased or 
decreased sensitivity to warfarin. 
5 In addition, the present invention provides a method of identifying a 

combination of factors correlated with an increased or decreased sensitivity to 
warfarin, wherein the factors are selected from the group consisting of one or more 
single nucleotide polymorphisms of the VKOR gene, one or more cytochrome p450 
2C9 polymorphisms, race, age, gender, smoking history, hepatic disease and any 

10 combination of two or more of these factors, comprising: a) identifying a subject 

having increased or decreased sensitivity to warfarin; b) detecting in the subject the 
presence of a combination of the factors; and c) correlating the presence of the 
combination of factors of step (b) with the increased or decreased sensitivity to 
warfarin in the subject, thereby identifying a combination of factors correlated with 

1 5 increased or decreased sensitivity to warfarin. 

Also provided herein is a method of correlating a combination of factors, 
wherein the factors are selected from the group consisting of one or more single 
nucleotide polymorphisms of the VKOR gene, one or more cytochrome p450 2C9 
polymorphisms, race, age, gender, smoking history, hepatic disease and any 

20 combination of two or more of these factors, with increased or decreased sensitivity 
to warfarin, comprising: a) identifying a subject having increased or decreased 
sensitivity to warfarin; b) identifying the presence of a combination of the factors in 
the subject; and c) correlating the combination of the factors of (b) with increased or 
decreased sensitivity to warfarin in the subject of (a). 

25 A combination of factors as described herein is correlated with an increased 

or decreased sensitivity to warfarin by identifying the presence of the combination of 
factors in a subject also identified as having increased or decreased sensitivity to 
warfarin and performing a statistical analysis of the association of the combination of 
factors with the increased or decreased sensitivity to warfarin, according to well 

30 known methods of statistical analysis. An analysis that identifies a statistical 

association (e.g., a significant association) between the combination of factors and 
the warfarin sensitivity phenotype (increased or decreased) establishes a correlation 
between the presence of the combination of factors in a subject and an increased or 
decreased sensitivity to warfarin in that subject. 
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Further provided herein are nucleic acids encoding VKOR and comprising one 
or more SNPs as described herein. Thus, the present invention further provides 
nucleic acids comprising, consisting essentially of and/or consisting of the nucleotide 
sequence as set forth in SEQ ID NOs:12, 13, 14, 15 and 16. The nucleic acids can 
5 be present in a vector and the vector can be present in a cell. Further included are 
proteins encoded by a nucleic acid comprising a nucleotide sequence as set forth in 
SEQ ID NOs:12, 13, 14, 15 and 16, as well as antibodies that specifically bind a 
protein encoded by a nucleic acid comprising a nucleotide sequence as set forth in 
SEQ ID NOs:12, 13, 14, 15 and 16. The present invention is more particularly 
10 described in the following examples that are intended as illustrative only since 

numerous modifications and variations therein will be apparent to those skilled in the 
art. 



EXAMPLES 

15 

EXAMPLE I CORRELATION BETWEEN SNPS IN VKOR GENE AND 
INCREASED OR DECREASED SENSITIVITY TO WARFARIN 

The most prevalent isoform of the VKOR gene is about 4 kb long, has three 
exons and encodes an enzyme of 163 amino acids with a mass of 18.4 kDa. In the 

20 present study, three mutations vk2581 (G>C), vk3294(T>C) and vk4769(G>A), 

identified as SNPs (heterozygosity ratios of 46.9%, 46.8% and 46.3%, respectively) 
were examined for a correlation between their presence in a subject and the 
maintenance dose of warfarin required to achieve a therapeutically effective 
response. 

25 1 . Selection of subjects 

Subjects were obtained from the UNC Coagulation Clinic in the Ambulatory 
Care Center. Informed consent was obtained by a trained genetic counselor. 
Subjects not fluent in English were excluded because of the lack of translators and 
the requirement for consent. To qualify for the study, subjects had warfarin for at 

30 least six months, were older than 18 and were followed by the UNC Coagulation 
clinic at the Ambulatory Care Clinic. 
2. Extraction of genomic DNA from whole blood 
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Genomic DNAs were extracted from the whole blood of subjects using 
QIAamp DNA Blood Mini Kit (QIAGEN cat#51104). The DNA concentration was 
adjusted to 10 ng/pL. 

3. Sequencing of the genomic DNA samples 
5 Approximately 1 0 ng of DNA was used for polymerase chain reaction (PCR) 

assays. The primers used to amplify the VKOR gene were: Exon 1-5' 
CCAATCGCCGAGTCAGAGG (SEQ ID NO:29) and Exon 1-3' 
CCCAGTCCCCAGCACTGTCT (SEQ ID NO:30) for the 5-UTR and Exon 1 region; 
Exon 2-5' AGGGGAGGATAGGGTCAGTG (SEQ ID NO:31) and Exon 2-3' 

1 0 CCTGTTAGTTACCTCCCCACA (SEQ ID NO:32) for the Exon 2 region; and Exon 3- 
5" ATACGTGCGTAAGCCACCAC (SEQ ID NO:33) and Exon 3-3' 
ACCCAGATATGCCCCCTTAG (SEQ ID NO:34) for the Exon3 and 3'-UTR region. 
Automated high throughput capillary electrophoresis DNA sequencing was used for 
detecting SNPs in the VKOR gene. 

15 4. Detection of known SNPs using real-time PCR 

The assay reagents for SNP genotyping were from the Assay-by-Design™ 
service (Applied Biosystems, cat#4332072). The primers and probes (FAM™ and 
VIC™ dye-labeled) were designed using Primer Express software and were 
synthesized in an Applied Biosystems synthesizer. The primer pairs for each SNP 

20 are located at the upstream/downstream position of the SNP site and can generate 
less than 100 bp length of a DNA fragment in the PCR reaction. The FAM™ and 
VIC™ dye-labeled probes were designed to cover the SNP sites with a length of 15- 
16 nt. The primer and probe sequences for each VKOR SNP are shown in Table 2. 
The 2X TaqMan™ Universal PCR Master Mix, No AmpErase UNG (Applied 

25 Biosystems, cat#4324018) was used in the PCR reactions. Forty cycles of real-time 
PCR were performed in an Opticon II (MJ Research) machine. There was a 10 
minute 95°C preheat followed by 92°C for 15 sec, 60°C for 1 min. and then a plate 
reading. The results were read according to the signal value of FAM and VIC dye. 
5. Statistical analysis 

30 The difference of average dose between different genotypes was compared 

by analysis of variance (ANOVA) using SAS version 8.0 (SAS, Inc., Cary, NC). A 
two-sided p value less than 0.05 was considered significant. Examination of the 
distribution and residuals for the average dose of treatment among the SNP groups 
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indicated that a log transformation was necessary to satisfy the assumption of 

homogeneity of variance. 

6. Correlation of SNPs with warfarin dosage 

By direct genomic DNA sequencing and SNP real-time PCR detection, five 
SNPs were identified in the VKOR gene: one in the 5'-UTR, two in intron II, one in 
the coding region and one in the 3-UTR (Table 1). 

Among these SNPs, the vk563 and vk4501 SNPs allele were carried by only 
one of the 58 subjects of the study (a triple heterozygous, also carrying the 3'-UTR 
SNP allele), while the other SNPs were identified in 17-25 heterozygous patients. 

Each marker was first analyzed independently. Figure 1 A shows that the 
average warfarin dose for patients with the vk2581 wild type allele was 50.19±3.20 
mg per week (n=26), while those heterozygous and homozygous for this 
polymorphism were 35.19±3.73 (n=17) and 31.14±6.2 mg per week (n=15), 
respectively. Figure 1 B shows that the average warfarin dose for patients with the 
wild-type vk3294 allele was 25.29±3.05 mg per week (n=11), while patients bearing 
the heterozygous and homozygous alleles were 41.68±4.92 (n=25) and 47.73±2.75 
mg per week (n=22), respectively. Figure 1C shows the average warfarin dose for 
patients with vk4769 SNP wild type was 35.35±4.01 mg per week (n=27), while 
patients with the heterozygous and homozygous alleles required 44.48±4.80 (n=19) 
and 47.56±3.86 mg per week (n=12), respectively. It was also observed that P450 
2C9 *3 has a significant effect on warfarin dose (Figure 1 D), as previously reported 
(Joffe et al. (2004) 'Warfarin dosing and cytochrome P450 2C9 polymorphisms" 
Thromb Haemost 91 :1 123-1 128). The average warfarin dose for patients with P450 
2C9 *1 (wild type) was 43.82±2.75 mg per week (n=50), while patients heterozygous 
for this allele required 22.4±4.34 mg per week (n=8). 
7. Statistical analysis 

The association of the Log e (warfarin average dosage)(LnDose) with the 
SNPs in the VKOR gene was examined by analysis of variance (ANOVA). SAS was 
used first to do a repeated procedure in which a series of factors (race, gender, 
smoking history, hepatic diseases, SNPs at cytochrome P450 2Y9 gene, etc.) were 
examined to identify factors, excluding VKOR SNPs, which might affect dosage. 
P450 2C9 *3 was significantly associated with the average dose of warfarin; thus, it 
was included as a covariant for further analysis. The analysis indicated that the 
three VKOR SNPs were still significantly associated with weekly warfarin dose 
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(vk2581, P < 0.0001; vk3294, P <0.0001; and vk4769, P= 0.0044), when the 
covariance is included. 

To specifically test if the three SNPs of VKOR were independently associated 
with warfarin dosage, the analysis was repeated in which two SNPs in the VKOR 
5 gene were included as covariates for the other SNP. The three VKOR SNPs are 
located within 2 kb distance of one another and are expected to be closely linked. It 
was clear from inspection that, at least for Caucasians, one haplotype (where 
A=vk2581 guanine and a=vk2581 cytosine; B=vk3294 thymine and b=vk3924 
cytosine; C=vk4769 guanine and c=vk4769 adenine) was AAbbcc and another 
10 aaBBCC. The distribution of individual SNPs in patients was found to be significantly 
correlated with the others (R=0.63-0.87, p<0.001). Indeed, subjects with the 
haplotype AAbbcc (n=7) required a significantly higher dosage of warfarin (warfarin 
dosage=48.98±3.93) compared to those patients with haplotype aaBBCC 
(25.29±3.05; p<0.001). 

15 

EXAMPLE 2 siRNA DESIGN AND SYNTHESIS 

siRNAs were selected using an advanced version of a rational design 
algorithm (Reynolds et al. (2004) "Rational siRNA design for RNA interference" 
Nature Biotechnology 22:326-330). For each of the 13 genes, four siRNAs duplexes 

20 with the highest scores were selected and a BLAST search was conducted using the 
Human EST database. To minimize the potential for off-target silencing effects, only 
those sequence targets with more than three mismatches against un-related 
sequences were selected (Jackson et al. (2003) "Expression profiling reveals off- 
target gene regulation by RNAi" Nat Biotechnol 21 :635-7). All duplexes were 

25 synthesized in Dharmacon (Lafayette, CO) as 21-mers with UU overhangs using a 
modified method of 2'-ACE chemistry (Scaringe (2000) "Advanced 5'-silyl-2'- 
orthoester approach to RNA oligonucleotide synthesis" Methods Enzymol 317:3-18) 
and the AS strand was chemically phosphorylated to ensure maximum activity 
(Martinez et al. (2002) "Single-stranded antisense siRNAs guide target RNA 

30 cleavage in RNAi" Cell 1 1 0:563-74). 
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EXAMPLE 3 siRNA transfection 

Transfection was essentially as previously described (Harborth et al. (2001) 
"Identification of essential genes in cultured mammalian cells using small interfering 
RNAs" J Cell Sci 1 14:4557-65) with minor modifications. 

5 

EXAMPLE 4 VKOR activity assay 

siRNA transfected A549 cells were trypsinized and washed twice with cold 
PBS. 1 .5x1 0 7 cells were taken for each VKOR assay. 200 uL buffer D (250 mM 
Na 2 HP04-NaH 2 P0 4 , 500 mM KCI, 20% glycerol and 0.75% CHAPS, pH 7.4) was 

1 0 added to the cell pellet, followed by sonication of the cell lysate. For assays of 

solubilized microsomes, microsomes were prepared from 2x1 0 9 cells as described 
(Lin et al. (2002) "The putative vitamin K-dependent gamma-glutamyl carboxylase 
internal propeptide appears to be the propeptide binding site" J Biol Chem 
277:28584-91); 10 to 50 uL of solubilized microsomes were used for each assay. 

15 Vitamin K epoxide was added to the concentration indicated in the figure legends 
and DTT was added to 4 mM to initiate the reaction. The reaction mixture was 
incubated in yellow light at 30°C for 30 minutes and stopped by adding 500 uL 0.05 
M AgN0 3 : isopropanol (5:9). 500 uL hexane was added and the mixture was 
vortexed vigorously for 1 minute to extract the vitamin K and KO. After 5 minutes 

20 centrifugation, the upper organic layer was transferred to a 5-mL brown vial and 
dried with N 2 - 150 uL HPLC buffer, acetonitrile:isopropanol:water (100:7:2), was 
added to dissolve the vitamin K and KO and the sample was analyzed by HPLC on 
an A C-18 column (Vydac, cat#218TP54). 

25 EXAMPLE 5 RT-qPCR (reverse transcriptase quantitative PCR) 

1x10 6 cells were washed with PBS twice and total RNA was isolated with 
Trizol reagent according to the manufacturer's protocol (Invitrogen). 1 ug of RNA 
was digested by RQ1 DNasel (Promega) and heat-inactivated. First strand cDNA 
was made with M-MLV reverse transcriptase (Invitrogen). cDNAs were mixed with 
30 DyNAmo SYBR Green qPCR pre-mix (Finnzymes) and real-time PCR was 

performed with an Opticon II PCR thermal cycler (MJ Research). The following 
primers were used: 

13124769-5' (F): (TCCAACAGCATATTCGGTTGC, SEQ ID NO: 1); 
13124769-3 (R)': (TTCTTGGACCTTCCGGAAACT, SEQ ID NO: 2); 
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GAPDH-F: (GAAGGTGAAGGTCGGAGTC , SEQ ID NO: 3); 
GAPDH-R: (GAAGATGGTGATGGGATTTC, SEQ ID NO: 4); 
Lamin-RT-F: (CTAGGTGAGGCCAAGAAGCAA, SEQ ID NO: 5) and 
Lamin-RT-R: (CTGTTCCTCTCAGCAGACTGC, SEQ ID NO: 6). 

5 

EXAMPLE 6 Over-expression of VKOR in Sf9 insect cell line 

The cDNA for the mGC1 1276 coding region was cloned into pVL1392 
(Pharmingen), with the HPC4 tag (EDQVDPRLIDGK, SEQ ID NO: 7) at its amino 
terminus and expressed in Sf9 cells as described (Li et al. (2000) "Identification of a 
1 0 Drosophila vitamin K-dependent gamma-glutamyl carboxylase" J Biol Chem 
275:18291-6). 

EXAMPLE 7 Gene selection 

The search for the VKOR gene was focused on human chromosome sixteen 
15 between markers D16S3131 and D16S419. This region corresponds to chromosome 
16 at 50cM-65cM on the genetic map and 26-46.3Mb on the physical map. 190 
predicted coding sequences in this region were analyzed by a BLASTX search of the 
NCBI non-redundant protein database. Those human genes and orthologs from 
related species with known function were eliminated. Because VKOR appears to be 
20 a transmembrane protein (Carlisle & Suttie (1980) "Vitamin K dependent 

carboxylase: subcellular location of the carboxylase and enzymes involved in vitamin 
K metabolism in rat liver" Biochemistry 19:1 161-7), the remaining genes were 
translated according to the cDNA sequences in the NCBI database and analyzed 
with the programs TMHMM and TMAP (Biology WorkBench, San Diego 
25 Supercomputer System) to predict those with transmembrane domains. Thirteen 
genes predicted to code for integral membrane proteins were chosen for further 
analysis. 

EXAMPLE 8 Cell line screening for VKOR activity 

30 The strategy was to identify a cell line expressing relatively high amounts of 

VKOR activity and use siRNA to systematically knock down all thirteen candidate 
genes. siRNA, double stranded RNA of 21-23 nucleotides, has been shown to cause 
specific RNA degradation in cell culture (Hara et al. (2002) "Raptor, a binding partner 
of target of rapamycin (TOR), mediates TOR action" Cell 110:177-89; Krichevsky & 
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Kosik (2002) "RNAi functions in cultured mammalian neurons" Proc Natl Acad Sci 
USA 99:11926-9; Bums et al. (2003) "Silencing of the Novel p53 Target Gene 
Snk/Plk2 Leads to Mitotic Catastrophe in Paclitaxel (Taxol)-Exposed Cells" Mol Cell 
Biol 23:5556-71). However, application of siRNA for large scale screening in 

5 mammalian cells has not previously been reported because of the difficulty in 
identifying a functional target for a specific mammalian cell mRNA (Holen et al. 
(2003) "Similar behaviour of single-strand and double-strand siRNAs suggests they 
act through a common RNAi pathway" Nucleic Acids Res 31:2401-7). The 
development of a rational selection algorithm (Reynolds et al.) for siRNA design 

10 increases the probability that a specific siRNA can be developed; furthermore, the 
probability of success can be increased by pooling four rationally selected siRNAs. 
Using siRNA to search for previously unidentified genes has the advantage that, 
even if VKOR activity requires the product of more than one gene for activity, the 
screen should still be effective because the assay determines the loss of enzymatic 

1 5 activity. 

Fifteen cell lines were screened and a human lung carcinoma line, A549, was 
identified to exhibit sufficient warfarin-sensitive VKOR activity for facile 
measurement. A second human colorectal adenocarcinoma cell line, HT29, which 
expressed very little VKOR activity, was used as a reference. 

20 

EXAMPLE 9 siRNA inhibition of VKOR activity in A549 cells 

Each of the thirteen pools of siRNA were transfected in triplicate into A549 
cells and assayed for VKOR activity after 72 hours. One siRNA pool specific for 
gene gi: 13124769 reduced VKOR activity by 64%-70% in eight separate assays 
25 (Fig. 2). 

One possible reason that VKOR activity was inhibited to only ~35% of its 
initial activity after 72 hours is that the half-life of mammalian proteins varies greatly 
(from minutes to days) (Zhang et al. (1996) 'The major calpain isozymes are long- 
lived proteins. Design of an antisense strategy for calpain depletion in cultured cells" 
30 J Biol Chem 271:18825-30; Bohley (1996) "Surface hydrophobicity and intracellular 
degradation of proteins" Biol Chem 377:425-35; Dice & Goldberg (1975) 
"Relationship between in vivo degradative rates and isoelectric points of proteins" 
Proc Natl Acad Sci USA 72:3893-7), and mRNA translation is being inhibited, not 
enzyme activity. Therefore, the cells were carried through eleven days and their 
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VKOR activity followed. Figure 3 shows that the level of mRNA for gi:1 3124769 
mRNA decreased rapidly to about 20% of normal while VKOR activity decreased 
continuously during this time period. This reduction in activity is not a general effect 
of the siRNA or the result of cell death because the level of VKD carboxylase activity 
and lamin A/C mRNA remained constant. Furthermore, the level of gi: 1321 24769 
mRNA is four fold lower in HT-29 cells, which have low VKOR activity, than in A549 
cells that exhibit high VKOR activity. These data indicate that gi: 131 24769 
corresponds to the VKOR gene. 

EXAMPLE 10 Identification of gene encoding VKOR 

The gene, IMAGE 3455200 (gi:1 3124769, SEQ ID NO: 8). identified herein to 
encode VKOR, maps to human chromosome 16p11.2, mouse chromosome 7F3, 
and rat chromosome 1:180.8 Mb. There are 338 cDNA clones in the NCBI database 
representing seven different splicing patterns (NCBI AceView program). These are 
composed of all or part of two to four exons. Among these, the most prevalent 
isoform, mGC1 1276, has three exons and is expressed at high levels in lung and 
liver cells. This three exon transcript (SEQ ID NO: 9) encodes a predicted protein of 
163 amino acids with a mass of 18.2 kDa (SEQ ID NO: 10). It is a putative N- 
myristylated endoplasmic reticulum protein with one to three transmembrane 
domains, depending upon the program used for prediction. It has seven cysteine 
residues, which is consistent with observations that the enzymatic activity is 
dependent upon thiol reagents (Thijssen et al. (1994) "Microsomal lipoamide 
reductase provides vitamin K epoxide reductase with reducing equivalents" Biochem 
J 297:277-80). Five of the seven cysteines are conserved among human, mice, rat, 
zebrafish, Xenopus and Anopheles. 

To confirm that the VKOR gene had been identified, the most prevalent form 
of the enzyme (three exons) was expressed in Spodoptera frugiperda, Sf9 cells. Sf9 
cells have no measurable VKOR activity but exhibit warfarin sensitive activity when 
transfected with mGC 11276 cDNA (Figure 4). VKOR activity is observed from 
constructs with an epitope tag at either their amino or carboxyl terminus. This tag 
should assist in the purification of VKOR. 

VKOR should exhibit warfarin sensitivity, therefore microsomes were made 
from Sf9 cells expressing VKOR and tested for warfarin sensitivity. The VKOR 
activity is warfarin-sensitive (Figure 5). 
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In summary, the present invention provides the first example of using siRNA 
in mammalian cells to identify an unknown gene. The identity of the VKOR gene was 
confirmed by its expression in insect cells. The VKOR gene encodes several 
isoforms. It will be important to characterize the activity and expression pattern of 
each isoforrn. Millions of people world-wide utilize warfarin to inhibit coagulation; 
therefore it is important to further characterize VKOR as it can lead to more accurate 
dosing or design of safer, more effective, anti-coagulants. 

The foregoing is illustrative of the present invention, and is not to be 
construed as limiting thereof. The invention is defined by the following claims, with 
equivalents of the claims to be included therein. 

All publications, patent applications, patents, patent publications and 
other references cited herein are incorporated by reference in their entireties for the 
teachings relevant to the sentence and/or paragraph in which the reference is 
presented. 
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Table 1 . Five SNPs examined in VKOR gene 



SNPs 


posit 
ion 


AA change 


Heterozygous ratio 


vk563 
G>A 
(SEQ ID 
NO: 15) 


5*- 
UTR 


N/A 


1/58 


vk2581 G>C 
(SEQ ID 
NO:12) 


Intro 
n2 


N/A 


17/58 


vk3294 T>C 
(SEQ ID 
NO:13) 


Intro 
n2 


N/A 


25/58 


vk4501 C>T 
(SEQ ID 
NO:16 


Exo 
n3 


Leu120Leu 


1/58 


vk4769 G>A 
(SEQ ID 
NO:14 


3'- 
UTR 


N/A 


19/58 
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